It was the training data pruning too!

نویسندگان

  • Pramod Kaushik Mudrakarta
  • Ankur Taly
  • Mukund Sundararajan
  • Kedar Dhamdhere
چکیده

We study the current best model (Krishnamurthy et al., 2017) (KDG) for question answering on tabular data evaluated over the WIKITABLEQUESTIONS dataset. Previous ablation studies performed against this model attributed the model’s performance to certain aspects of its architecture. In this paper, we find that the model’s performance also crucially depends on a certain pruning of the data used to train the model. Disabling the pruning step drops the accuracy of the model from 43.3% to 36.3%. The large impact on the performance of the KDG model suggests that the pruning may be a useful pre-processing step in training other semantic parsers as well.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maximum a posteriori pruning on decision trees and its application to bootstrap BUMPing

The cost-complexity pruning generates nested subtrees and selects the best one. However, its computational cost is large since it uses holdout sample or cross-validation. On the other hand, the pruning algorithms based on posterior calculations such as BIC (MDL) and MEP are faster, but they sometimes produce too big or small trees to yield poor generalization errors. In this paper, we propose a...

متن کامل

Effects of Pruning on Haloxylon aphyllum L. Dimensions and its Application in Biological Reclamation of Desert Regions in Yazd Province

 Knowledge of the Saxaul dimensions used in sand dunes stabilization is considered essential for designing live windbreak in desert regions. This research aimed to collect and analysis data and was performed on the pruned and control shrubs of Haloxylon aphyllum L. in Yazd province, Iran in the last two decades. Our review clearly showed the superiority of shrubs pruned at the height of 35 cm i...

متن کامل

اثر تراکم و هرس بوته بر عملکرد و رشد بوته دو رقم فلفل دلم های گلخانه ای

In order to investigate the effects of plant density and pruning on yield characteristics and growth of two bell pepper cultivars, a 2×3×2 factorial experiment was conducted in a complete randomized block design with three replications and three factors of plant density (2.5, 3 and 3.5 plants per m2), shoot pruning at two levels (without pruning and training plants with 3 main stems) and two be...

متن کامل

Learning with data adaptive features

The cost-complexity pruning generates nested subtrees and selects the best one. However, its computational cost is large since it uses hold-out sample or crossvalidation. On the other hand, the pruning algorithms based on posterior calculations such as BIC (MDL) and MEP are faster, but they sometimes produce too big or small trees to yield poor generalization errors. In this paper, we propose a...

متن کامل

Effect of pruning on growth, development, seed yield and active substances of Pumpkin (Cucurbita pepo convar. pepo var. styriaca)

The objective of this study was to investigate the effect of pruning in different developmental stages on growth, development, seed yield and active substances of medicinal pumpkin (these active substances are uses for remedy the Benign Prostatic Hyperplasia (BPH)). The experiment was performed in a RCB design. Five pruning treatments in different developmental stages (no pruning, after 3-5 nod...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2018